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The  Large  Deviation  Principle  for  the  Sairple  Average  Process 
and  Functional  Erdos -Renyi  Laws 

Abstract 

The  large  deviation  principle  is  established  for  the  sairple  average 
process.  The  principle  is  then  used  to  obtain  functional  counterparts 
of  the  Erdos-Renyi  type  laws  of  Erdos  and  Renyi  (1970)  and  Shepp  (1964) . 


1.  Introduction.  Let  Xp  X2,...  be  i.i.d.  random  variables  with 
E(Xj)  =  0  and  a2  =  EQCj2)  <  «.  Let  P(t),  t*0,  denote  the  random  polygon 
where  P(n)  =  Sn  =  X^  +  ...  +  with  P(0)  =  SQ  =  0  and  is  linear  between 
the  integers. 

The  classical  functional  laws  examine  the  asymptotic  behavior  of  segments 

of  the  polygon  originating  at  the  origin  upon  various  renormalizations  of 

the  space  and  time  axes.  In  particular,  let  Dn(t)  ■  P(tn)/o^5T  and 

2 

Sn(t)  =  P(tn)/ Amloglcgn/5 .  Let  J(a)  =  a  /2  which  is  the  large  deviation 
rate  (Section  2)  for  the  normal  distribution.  Let  1(f)  =  ^J(f)dt  if  f  is 
absolutely  continuous  (f  denotes  the  Radon-Nikodym  of  f)  and  =  »  if  not. 

Let  ra  ■  (f:I(f)  ia).  Then,  the  invariance  principles  of  Donsker  (see 
Billingsley  (1968)  and  Strassen  (1964))  state  that  {D^(t) :0<t<l}  -*■  (W(t) :0stsl} 
in  distribution,  W  (•)  the  standard  Brownian  Motion,  and  that  the  set  of  cluster 
points  of  {S  (t):0<t^l)  is  r  a.s..  Here  D  (■)  and  Sf*)  are  both  viewed 

II  a  Jl  II 

as  random  functions  on  C[0,1]  with  the  usual  norm  topology. 

The  two  functional  law's  proved  here  examine  the  asymptotic  behavior  of 
segments  of  P(t)  that  move  off  toward  infinity,  i.e.,  moving  averages.  These 
laws  are  the  functional  counterparts  of  the  Erdds-Renyi  (E-R)  type  laws  of 
Erdos  and  RSnyi  (1970)  and  of  Shepp  (1964)  just  as  the  Donsker  and  the 
Strassen  Invariance  Principles  are  the  functional  analogues  of  the  central 
limit  theorem  and  the  law  of  the  iterated  logarithm.  Note,  though,  that 
these  E-R  type  functional  laws  are  not  invariance  principles  since  the  large 
deviation  rate  of  Xp  which  uniquely  determines  the  distribution  of  Xp  can 
be  calculated  from  the  conclusions  of  these  laws. 
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2 

To  state  these  results  we  need  the  following.  For  m  and  n  positive 
integers  and  a>0,  let  ^(s)  *  P(m+s)  -  P(m)  and  let  A^  n  a(s)  = 

Am(sCa  Hogn3)/Ca  *logn]  where  Cxi  indicates  the  integer  part  of  x.  We 
futher  assume  that  the  m.g.f.  of  X^  is  finite  in  seme  open  interval  about  0. 

Let  J  denote  the  large  deviation  rate  for  X^  (see  2.10).  Let 
ra^  *  (b:  J(b)  sa).  Then  ra^  is  an  interval  with  -»  <  b^  <  b2  <  ®  since 

E(Xj)  =  0  and  the  m.g.f.  is  finite  in  an  open  interval  about  0. 

With  Am  n  a  =  Ap  n  a(l)  we  now  state  the  classical  E-R  type  laws  in  a 
manner  that  contrasts  them  with  their  functional  counterparts .  The 
Erdos-Renyi  Law,  as  stated  below,  is  stronger  than  the  original  statement 
and  is  along  the  lines  of  that  given  in  Theorem  5  of  Acosta  and  Kuelbs  (1983) 
for  moving  averages  of  i.i.d.  random  variables  taking  values  in  a  Banach 
space. 

Shepp's  Law.  The  set  of  cluster  points  of  (a  }  is  r  J  =  tb,,b9]. 

n,n,a  3  x*  z 

In  particular  lbn  4n>n>?  =  bj  and  Tin  4^,  -  b2  a.s.. 

For  a  set  A  and,  e  >o,  let  A^  =  (b:  jb-a|  <  e  for  some  aeA). 

The  Erdos-Renyi  Law.  Let  e >  o.  Then  with  probability  1, 

(Am  n  a  :  msn)  c  (raJ)e  eventually 
and 

ra  c  ^Am,n,a  ’  eventually. 

In  particular,  the  set  of  cluster  points  of  the  triangular  array  (A_  : m s xl) 

m,n  ^3 

is  ra  = 

Let  I  (*)  denote  the  rate  function  for  the  sample  average  process. 

This  is  defined  in  terms  of  J(»)  in  Section  3.  Let  r  1  *  r  =  (f:I(f)  sa} 
which  is  "contact"  (Theorem  3.1,  Definitions  2.1  and  2.2)  and  convex 
(definition  of  I  from  the  convexity  of  J). 
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In  Section  4  we  establish  the  following  functional  counterparts  of 
Shepp’s  and  of  Erdos  and  Renyi’s  laws: 

(1.1)  The  set  of  cluster  points  of  {Aj,  n  a(*))  is  ra* 

(1.2)  Let  e>  o.  V'/ith  probability  1, 

:msn}  c  ra,e  eventually 


ra  <=  (A„  „  „(•)  :  msn}  . 
a  m,n,a  e 

In  particular,  the  set  of  cluster  points  of  the  triangular  array 

Remark  1.1.  Since  ra  is  "compact",  we  have  from  (1.1)  that  the  set 
of  cluster  points  of  H(A_)  is  H(rv)  and  Tun  H(a  _(•))  =  sup  H(f)  <® 

n  a  Ti,n,a  r  gt» 

a 

for  any  "continuous"  function  H.  In  particular,  if  H(f)  =  f(l),  then  the 
set  of  cluster  points  of  (A„  „  „>  is  (f(l)  :  fel\)  and  Bin  A  ■  sup  f(l) 

n,n,d  a  11,11, a 

which  is  a  restatement  of  Shepp's  lav?.  A  similar  remark  can  be  made  con¬ 
cerning  the  Erdos -Rcnyi  law. 

The  original  Erdos-Renyi  Law,  which  states  that  lim  ma^c  n  a  =  ^2 
and  lim  A^  n  ,a  =  ^1  a's>  ^oes  not  seem  be  a  direct  consequence  of 
the  functional  law  (1.2). 

Remark  1.2.  The  issues  of  what  space  in  which  A(*)  takes  its  values 
and  with  what  topology  have  been  ignored  here.  These  will  be  addressed  in 
Sections  3  and  4. 

The  key  to  proving  (1.1)  and  1.2)  is  to  establish  the  "large  deviation 
principle"  (LDP)  for  ^(t)  ■  P(nt/n) .  Roughly  speaking,  the  principle  says 
that  PQ^(*)ef*df)  *  e"n*^df.  This  provides  suitable  approximations  for 
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certain  probability  statements  about  the  sample  average  process,  "5(^(0  • 

Once  such  approximations  are  established,  the  proofs  of  (1.1)  and  (1.2)  avc 
essentially  those  of  Shepp  and  of  Erdos  and  Renyi  with  modifications  taken 
from  Acosta  and  Kuebb.  This  is  done  in  Section  4.  A  review  concerning  the 
LDP  is  given  in  Section  2  while  the  LDP  is  established  for  X^O  in  Section  3. 

2.  The  Large  Deviation  Principle.  Let  X  be  a  topological  space  and  F 
be  the  Borel  c-field  in  X.  Let  (Pn}be  a  family  of  probability  measures  on 
(X, F) .  Here  we  review  some  general  ideas  regarding  large  deviations  for  {Pn> 
while  in  the  next  section  we  are  interested  in  results  specifically  for  Pn, 
the  probability  measure  induced  by  the  sample  average  process  "5^(0* 

The  following  definitions  which  are  slight  variants  of  Varadhan  (1984) 
allow  us  to  state  many  large  deviation  results  in  a  concise  form. 

Definition  2.1.  A  function  1(0  on  X  is  said  to  be  a  regular  rate 
function  if 

(2.1)  0sl(x)s», 

(2.2)  I(*)  is  lower  semi -continuous  (lsc)  and 

(2.3)  for  each  c<®,  rc  *  (x:I(x)  sc)  is  compact. 

For  any  subset  A  of  X,  define 

(2.4)  1(A)  *  inf  I (x) . 

XeA 

Definition  2.2.  The  measures  (Pn>  satisfy  the  large  deviation  principle 
(LDP  or  LD  principle)  with  rate  function  1(0  if 

(2.5)  1(0  is  a  regular  rate  function, 
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(2.6)  for  each  closed  set  F, 

Tim  -  log  Pn(F)  <  -  1(F),  and 

(2.7)  for  each  open  set  G, 
lim  J  log  Pn(G)>-  1(G). 

Definition  2.3.  The  measures  (Pr>  satisfy  the  weak  large  deviation 
principle  (WLDP  or  the  weak  LD  principle)  with  rate  function  I(°)  if  (2.5) 
and  (2.7)  of  Definition  2.2  together  with  (2.8)  below  are  satisfied: 

(2.8)  for  each  compact  set  K,  Tin  ^  log  P  (K)  s  -  I(K). 

Definition  2.4.  The  measures  {Pn>  are  large  deviation  tight  (LD  tight) 
if,  for  each  M>®,  there  exists  a  compact  set  such  that 

(2.9)  lm  i  log  Pn(KMc)  <  -  M. 

The  following  lemma  whose  proof  is  simple  and  left  to  the  reader  shows 
the  usefulness  of  LD  tightness. 

Lemma  2.5.  Let  (P„)  be  LD  tight  and  satisfy  the  VLDP.  Then  it  satis- 
fies  the  LDP. 

Many  interesting  applications  in  large  deviations  occur  when  X  is  a 
Polish  space,  that  is  a  separable  complete  metric  space.  This  is  the  setting 
in  Section  3.  In  this  context,  two  important  and  immediate  derivatives  of 
the  LDP  are  the  contraction  principle  and  the  asyrptotic  expression  for  cer¬ 
tain  intergrals.  This  latter  result  is  not  used  in  the  sequel  but  is  stated 
below  for  completeness.  For  proofs  see  Varadhan  (1966,  1984). 

Let  {Pn>  satisfy  the  LDP  with  rate  function  I(x).  Let  h  be  a  continuous 
map  from  X  into  another  Polish  space  V,  and  let  Qn  *  Pnh_1. 
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Contraction  Principle.  The  measures  {Qn>  satisfy  the  LDP  with  rate 
function 


K(y)  *  .  inf  I(x). 
x:h(x)=y 

Asymptotic  expression  for  certain  integrals.  Let  F  be  a  bounded  real 
valued  continuous  function  on  X.  Then 

J-log  /  exp  (XF(x))  dPn(x)->sup  [F(x)-I(x)3. 

The  LDP,  along  with  the  above  two  results,  has  the  flavor  of  weak 
convergence  of  probability  measures  (Theorems  2.1  and  5.1  of  Billingsley, 

IS '3).  The  following  lemna  is  the  analogue  of  the  converse  part  of  Prohorov's 
Theorem  (Billingsley,  1968,  Theorem  6.2)  and  with  Lemma  2.5  shows  that  for 
Polish  spaces  the  LDP  is  equivalent  to  the  VTLDP  and  LD  tightness.  The  proof 
is  similar  to  Billingsley's  proof  of  Prohorov's  Theorem  and  can  be  found  in 
Lynch  and  Sethuraman  (1984) . 

Lemma  2.6.  If  {P„}  is  a  sequence  of  probability  measures  which  satis- 
■ r  n 

fies  the  LDP,  then  (Pn)  is  LD  tight. 

The  earliest  example  of  the  LDP  is  when  ?n  is  the  probability  measure 
induced  by  the  average  of  n  i.i.d,  observations.  It  is  summarized  in  the 
following  theorem  which  is  variously  referred  to  as  Cramer's  Theorem  and 
Chemoff's  Theorem. 

0X1 

Let  Xp  X^,  ...  be  i.i.d.  with  m.g.f.  <(>(0)  =  E(e  ).  Let  ^(e)  85  log<f>(0) 
denote  the  cumulant  generating  function  of  X^  and  define 

(2.10)  JY(a)  =  J(a)  *  sup  (ea-^(0)}. 
x  e 

Let  Pn  denote  the  probability  measure  induced  by  =  (Xj+. . .+Xn)/n.  Then, 
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Theorem  2.7.  (Cramer,  1938;  Chemoff,  1952).  The  probability 
measures  {PnJ  satisfy  the  LDP  with  rate  function  J(*)  • 

The  following  results  show  how  LD  properties  for  marginal  measures  carry 
over  to  the  product  measures.  These  are  needed  in  the  next  section.  The 
proof  of  Lemma  2.8  is  obvious  while  that  of  Lemma  2.9  may  be  found  in  Lynch 
and  Sethuraman  (1984). 

Let  (Pn*)  he  a  sequence  of  probability  measures  on  a  Polish  space  X  , 

12  12 
i  =  1,2.  Let  P  =  P  x  P  be  the  product  measure  on  X  *  X  x  X  . 

’  n  n  n 

Lemma  2.8.  If  {P  x)  is  LD  tight  for  i  =  1,2,  then  (Pn>  is  LD  tight. 

Lemma  2.9.  Let  {P1}  satisfy  the  VJLDP  with  rate  function  I1(xi), 
i  =  1,2.  Then  (Pn>  satisfies  the  IVLDP  with  rate  function  I(x1>  x2)  =  I1(x1) 

♦  IZ(x2). 

The  following  corollary  follows  from  Lemmas  2.6,  2.8  and  2.9. 

Corollary  2.10.  Let  {P^}  be  LD  tight  and  satisfy  the  VJLDP,  i  =  1,2. 

i  2 

Then  P„  =  P  "  x  P  satisfies  the  IDP  with  rate  function  I (x. ,x7)  = 
n  n  n  it 

^(Xj)  +  I2(x2). 

3.  The  LDP  for  the  Sample  Average  Process.  Let  Xj,  X2  ...  be 

i.i.d.  r.v.’s.  life  assume  that  the  m.g.f. ,  <f>(0)>  is  finite  in  some  open 

interval  about  zero.  Thus  the  mean  is  finite  and  we  will  without  loss  of 

generality  assume  that  E(X^)  =  0.  Recall  that  from  (2.10)  the  large 

deviation  rate  for  "X  is  J(a)  =  sup{ea-log<|>(e)}. 

n  0 

To  state  precisely  the  LDP  for  the  sample  average  process  X~n(°)  t  where 

^(t)  *  P(nt)/n  we  make  the  following  digression.  V'e  note  that  J(*)  is  convex 

with  its  minimum  zero  at  zero  since  E(Xj)  =0.  So,  J(a)/a  is  increasing 

(decreasing)  in  a  >  0  (a<0).  Let  C-,  *  lim  J(a)/a  and  C2  «*  lim  J(-a)/a. 

a a-*» 
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Let  BV[0,13  denote  the  space  of  functions  which  are  right  continuous 
and  of  bounded  variation  on  [0,13  endowed  with  the  weak*  topology  -  better 
known  as  the  topology  of  weak  convergence  or  convergence  in  distribution 
amongst  statisticians  and  probabilists.  Let  C[0,1]  denote  the  space  of 
continuous  function  on  [0,1]  with  the  usual  uniform  topology. 

For  feBVtO,l],  let  f  =  h-,  -  denote  the  Hahn- Jordan  decomposition  of 
f,  let  f  denote  the  Radon-Nikodym  derivative  of  its  absolutely  continuous 
part  and  let  h|  and  h|  denote  the  singular  parts  of  and  Let 
1(f)  ■  ^  J(f)dt  +  C^h£l0,13  +  C2h|[0,13  and  feBV[0,13  and  =  °°  if  not,  where  we 
adopt  the  convention  that  0*“  =  0.  In  particular,  if  Cj  =  »  =  C2  then 
1(f)  =  J(f)dt  if  f  absolutely  continuous  and  ■»  if  not. 

If  either  or  C2  is  finite  (Case  1)  we  must  view  5T  (•)  as  taking  values 
in  BV[0,13.  If  Cx  and  C2  are  both  infinite  (Case  2)  we  may  view  Xn(*)  as 
taking  values  in  either  BV[0,13  or  C[0,13.  The  reason  we  must  distinguish 
these  two  cases  is  that  ?a  =  (f:I(f)  s a)  is  not  a  compact  subset  of  C[0,13 
when  Case  1  obtains. 

Theorem  3.1.  Let  Pn  denote  the  probability  measure  induced  by  ^n(*)» 

Then  (Pn>  satisfies  the  LDP  with  rate  function  I (•) . 

The  proof  that  I(-)  is  a  regular  rate  function  can  be  fotmd  in  Lynch 
and  Sethuraman  (1984)  for  Case  1  and  in  Varadhan  (1966)  for  Case  2  (c  .'f . 
Groeneboom  et  al.,  1979).  Here  we  only  outline  the  proof  of  the  upper  and 
lower  probability  bounds  in  the  principle. 

To  proof  of,  those  bounds  is  somewhat  technical  but  in  reality  involves 
only  three  key  ideas.  First  the  process  ^(»)  is  approximated  by  a  finite 
dimensional  random  vector  whose  components  are  independent.  Then  Corollary 
2.10  with  Theorem  2,7  and  the  contraction  principle  establishes  the  LDP  for 
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the  finite  c&ransional  process.  Finally,  Lemma  3.2  and  Theorem  3.3,  belov, 

show  that  the  approximations  are  suitable  enough  to  establish  the  principle. 

We  need  the  following.  For  a  partition  P  -  {0  =  tg  <  t^  <  . .  •  <  t^  =  1}  let 

^t  =  tA  -  tj.j  and  Atf  =  fftj)  -  f(ti.1).  Let  Ip(f)  =  i 

Analogous  to  (2.5)  let  I„(A)  =  inf  Ip(f).  Let  A  =  ess  inf  X,  and 

P  feA  V 

B  -  ess  sup  Xj.  Note  that  if  A  is  finite  then  J(A)  =  -logP(Xj  =  A).  A 

similar  statement  about  J(B)  can  be  made  if  B  is  finite.  Also  note  that  J(0 

is  continuous  an  LA*B]  (even  if  A  or  B  is  infinite)  where  if  J(A)  =  »,  the 

lim  J(a)  a  «  etc.  Thus  Ip(*)  is  continuous  on  [A,B]  . 
a+A  y 

Let  B  denote  the  BoreL_o- field  in  C0,1]. 

Lemma  3.2.  Ip(f)  +  1(f)  as  o(P)  t  8. 

Theorem  3.3.  (The  Minimax  Theorem)  if  F  is  closed,  then  sup  IP(F)  “1(F). 

P 

Remark  3.4.  The  proof  of  Lemma  3.2  depends  on  the  fact  that  (fp,  a  (P)>, 
where  fp  =  E(fp(U)  j  o(P))  and  U  is  uniform  on  [0,1],  is  a  martingale  and  can 
be  found  in  Lynch  and  Sethuraman  (1984) .  A  proof  of  the  minimax  theorem  for 
Case  1  can  also  be  found  there.  The  proof  for  Case  2  is  similar  (c.f. 
Groeneboon  et  al.,  1979,  Lemma  2.4). 

Outline  -  lower  bound.  We  only  do  the  proof  for  Case  1.  The  proof  for 
Case  2  is  similar  but  somewhat  more  complicated  and  may  be  found  in  Varadhan 
(1966) . 

Let  G  be  an  open  set.  If  1(G)  =  ®  there  is  nothing  to  prove.  So  assume 

that  1(G)  <«.  For  e>0  choose  f  such  that  1(f)  <  1(G)  ♦  e.  There  is  a 

partition  P  =  (0  ■  tp  <  tj  <  . . .  <  t.  *  1)  of  continuity  points  f  such  that 

Np  =  (g:max  |  A.g  a  f  |  ^  r-  G. 

r»e  i<V  A  x 
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Since  £  is  continuous  at  t^,  for  all  sufficiently  large  n,  (Xn(,)eNp  } 


Cnt.  3 


*  &“-£-)  '  £C°))l<^|XCntl3  +  1/n|  <e’KS[nt23  ' 

Cnt7]  Cnt,  3  X[nt7]+1 

(f  (--—-)  -  f(— -— ))[  <  e,  — ~  <  e,  etc.).  So  for  such  an  n,  by 


independence , 


(3.1)  P(y.)  e  N  p^)s  n  PC 


k- l  fnt4 3- Cnt . 3+1  Cnt .  . 3 

■  p( - 1  V  -  (*Hr^  - 


Since  (J-^  <e)+l  as  n-.«. 


by  (3.1)  and  Theorem  2.7, 


lim  n'hogPCCp^OcNp^)  a  -Ip(Np>e)  a  -Ip(f)  *  -1(f) 

where  the  last  inequality  follows  from  Lemma  3.2  and  the  second  to  last 
from  the  definition  of  Ip(-) . 


Outline  -  the  upper  bound .  Let  F  be  a  closed  set.  Fix  a  partition 
P  =’  {0  *  *0  <  *1 <  *  *  *  <  *k  ”  £or  assume  1:  *  2.  Note  that, 

from  the  definition  of  Ip(0 , 

P(y  OeF)  s  P(Ip(7^(0)  *  Ip(F)) . 

Thus,  to  establish  the  upper  bound  it  suffices  to  show  that 

(3.2)  Hm  n'1logP(Ip(^(.))  a  a)  s  -a. 

Now  the  event 

8 

(IpCL(*)  a  a)  eu  Ajn 

r  11  j«l 


(3.3). 
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whereAln“  {X[nti:i  +  Is0'  SCnt1]  +  l2°»  811(1  W'^ia} 

and  A^,  ...»  A^  denote  the  other  possible  sets  resulting  from  the  seven 

other  choices  of  the  first  three  inequalities.  For  A^,  let  K  be  a  fixed 

finite  positive  number  less  than  or  equal  to  B.  Then,  since  J  is  nonnegative 

and  increasing  in  a  >  0  and  convex 


Since  J  is  continuous  on  [A,BD  and  0  <  K  s  B  it  follows  from  Theorem 
2.7,  Corollary  2.10  and  the  contraction  principle  that 

(3.5)  T3m  n"1  logPns -a. 

By  Markov's  inequality, 

(3.6)  n'Hog^s  -tj6K+  ■*  -tjSK 

as  n-*-®  for  every  0  for  which  y(q)  <  ®.  If  B  <  ®,  then  y(e)  <  ®  for  all  6  >  0, 
while  if  B»«,  then  K  may  be  taken  arbitrarily  large.  In  any  event  (3.4) 
combined  with  (3.5)  and  (3.6)  and  the  above  observation  show  that 
lim  n  ”1logP(Aln)  s  -a.  This  with  (3.3)  proves  (3.2)  and  completes  the 
proof  of  the  upper  bound. 

4.  Functional  E-R  Laws.  With  the  formulation  the  same  as  in  Section 
3,  let  I  (•)  denote  the  rate  function  for  1^(0  and  recall  the  definition 
of  in  the  Introduction.  As  before  ra"  (f:I(f)  sa). 
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For  a  set  A,  let  Ag  =  (g:d(f,g)  <  e  for  sane  f  e  A}  where  these  sets 
are  viewed  as  subsets  of  BV[0,1]  (Case  1)  or  CC0,13  (Case  2)  and  d  is  either 
the  metric  which  induces  the  weak*  topology  (Case  1)  or  the  uniform  topology 
(Case  2). 

The  functional  analogues  of  Shepp's  (1964)  and  Erdos  and  Renyi's  laws 
can  be  stated  as  follows. 

Theorem  4.1,  The  set  of  cluster  points  of  {a  _(•))  is  r  • 

Theorem  4.2.  Let  e>0.  With  probability  1, 

(i)  n  a(0:  msn)  c- ra  e  eventually 

and 

<il}  rac  fV,n,a(-’:  m£n,t- 

(iii)  In  particular,  the  set  of  cluster  points  of  the  triangular  array 
(Vn,a(i):  msn}  “  ra* 

To  prove  these  theorems  we  need  the  following  lemma. 

Lemma  4.3.  (i)  For  each  e  >  0  there  exists  a  c  >  a  for  which  r_  c  r_ 

C  3 fC 

and  (ii)  r  equals  the  closure  of  {f:I(f)  <a). 

Proof,  (i)  Suppose  not.  Then  for  every  c>a  there  exists  an  f  er 

with  fc^ra,e.  Fix  b>a.  Since  rb  is  compact  the  net  (fc:a<c<b)  has  a 

subnet  {f^}  which  converges,  say,  to  fQ,  as  dia.  By  the  lsc  of  I(*)  it 

follows  that  I(f0)  *  lim  I(f ,)  sa,  and  so  ffter_.  But  f  It.  r  implies  that 
v  d+a  °  u  a  c  a,e 

d(f0,fc)  >  c  for  every  c  which  contradicts  that  {f^}  converges  to  fp. 
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(ii)  Fix  fer  with  1(f)  ■  a.  It  suffices  to  show  that  I  (of)  <a  for  0<a<l. 

a 

This  is  iraiediate  since  I  is  convex  with  1(0)  *0,  where  0  denotes  the  func¬ 
tion  which  is  identically  zero.  || 

Proof  of  Theorems  4.1  and  4.2.  Fix  a  a  0  and  e  >  0.  Vfe  first  show  that 

(4.1)  for  any  fera,  d((A_  „  a(*),f)  <  e)  infinitely  often  a.s. . 

a  n,n,a 

Let  0= (g:  d(g,f)  <  e).  Note  that  since  0  is  open,  a-feO  for  some 
ae(0,l).  By  the  LDP, 

lim  n'Hog  PO^eO)  i-I(O)  2:-I(af). 

Since  I(»)  is  convex  with  1(0)  *0  and  ae(0,l),  I(af)  <al(f)saa.  Thus, 

(4*2)  P^,n,a^’^c0)  in‘a(1+°(1))* 

Since  m"1  =  «>,  for  a'e(a,l)  fixed  it  follows  from  Lerana  (3.1)  of  Shepp  (1964) 

that  there  exists  a  sequence  (n(k)}  with  n(k+l)  =n(k)  +  [a  Hog  n(k)  ]  such 

» 

that  m(k)  -<*>.  This  with  (4.2)  shows  that  a(*)c0)  =  ®. 

Statement  (4.1)  follows  from  this  and  the  divergent  part  of  the  Borel-Cantelli 
lemma  since  the  events  (An^  a( *) e0} ,  k  =  1,  2,  ...  are  independent. 

Vfe  now  show  that,  with  probability  1, 

(4.3)  {Vn,a(*):  msn>cra,e  eventually* 

For  all  sufficiently  large  integer  k,  let  Keeak+1  and  let  l(k)  =  K-l 
if  K  is  an  integer  and  *  CK)  if  not.  Then,  since  k  «*  [a* *log  k)  for 

l(k-l)  <ns  100 ,  Ajn,n,a^  *  *m,l(k)  ,a^  for  m * n*  t0  Prove  *3^ * 
it  suffices  to  shew  that 

P^Am,l(k)  a^Her0  c  *°T  som  msl(k)i.o.)  *°* 


(4.4) 
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By  Lemma  4.3,  (i)  there  exists  a  c>a  such  that  rccra>e*  Thus 

I(rf  )  2C.  So  by  the  LDP, 
a,e 

TE  n_1log  P(^C*)er^e)  s  -l(rj#e)  *  -c 

since  r  is  closed  with  r*  =r  c.  Thus,  for  k'  sufficiently  large, 
a,c  a,£  c 

1*k?(4k.l<k),aM*tS,.  £or  sore 

S1  ib^W)).  £  ey»-c)(i.o»)), 
fc>k*  k*k‘ 

So  (4.4)  follows  from  the  convergent  part  of  Borel-Cantelli  lemma.  This 
completes  the  proof  of  (4.3). 

Since  r  is  closed,  r  +r  as  e+0.  This  with  (4.1)  and  (4.3)  completes 

a  cL9£ 

the  proof  of  Theorem  4.1  and  (i)  of  Theorem  4.2. 

To  prove  (ii)  of  Theorem  4.2,  let  e>0.  Then,  by  Lemma  4.3  (ii),  there 

is  a  finite  collection  {£.,...  ,f,  )  c  r  such  that  r  c  fr  If  - ) 

l  k  a  a  i»i  1  e/2 

Since  (f. }  is  an  open  set,  by  the  LDP, 

1  c/2 

lta  n’hog  P(\  c  fft)e/2)  a -IC{fi>V2)  a 
So, 
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SAPtW  „/  ‘‘I1.//"7'3  l0g"D'1 
.j.a-p(x[a-1iog„1(f1)£/2,)tn/[a'lloenl)'1 
5  r  (i.e'I(£i)la“l10"  nKi+0(i))(n/[a  llog  n]) 


i=i 


k  (-I(fi)a’1)(l+0(l))/a~Al°g  n 
sU11  1 


-1, 


i=i 


-x 


where  in  the  last  inequality  we  used  that  (1-x)  <  l 

Since  I(f.)  <a  for  i  =  i,...,k,  it  follows  that  z  p(ra^\>n>a:mSnJe  )<  “• 

This  with  the  convergent  part  of  the  Borel-Cantelli  lemma  completes  the 
proof. 
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